Speech emotion recognition based on listener-dependent emotion perception models

نویسندگان

چکیده

This paper presents a novel speech emotion recognition scheme that leverages the individuality of perception. Most conventional methods simply poll multiple listeners and directly model majority decision as perceived emotion. However, perception varies with listener, which forces their single models to create complex mixtures criteria. In order mitigate this problem, we propose majority-voted framework constructs listener-dependent (LD) models. The LD can estimate not only listener-wise emotion, but also by averaging outputs Three models, fine-tuning, auxiliary input, sub-layer weighting, are introduced, all inspired successful domain-adaptation frameworks in various processing tasks. Experiments on two emotional datasets demonstrate proposed approach outperforms recognition.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech Emotion Recognition Using Scalogram Based Deep Structure

Speech Emotion Recognition (SER) is an important part of speech-based Human-Computer Interface (HCI) applications. Previous SER methods rely on the extraction of features and training an appropriate classifier. However, most of those features can be affected by emotionally irrelevant factors such as gender, speaking styles and environment. Here, an SER method has been proposed based on a concat...

متن کامل

Speech Emotion Recognition Based on Sparse Representation

Speech emotion recognition is deemed to be a meaningful and intractable issue among a number of domains comprising sentiment analysis, computer science, pedagogy, and so on. In this study, we investigate speech emotion recognition based on sparse partial least squares regression (SPLSR) approach in depth. We make use of the sparse partial least squares regression method to implement the feature...

متن کامل

Speaker dependent emotion recognition using speech signals

This paper examines three algorithms to recognize speaker’s emotion using the speech signals. Target emotions are happiness, sadness, anger, fear, boredom and neutral state. MLB(Maximum-Likelihood Bayes), NN(Nearest Neighbor) and HMM(Hidden Markov Model) algorithms are used as the pattern matching techniques. In all cases, pitch and energy are used as the features. The feature vectors for MLB a...

متن کامل

Speech emotion recognition using hidden Markov models

This paper introduces a first approach to emotion recognition using RAMSES, the UPC’s speech recognition system. The approach is based on standard speech recognition technology using hidden semi-continuous Markov models. Both the selection of low level features and the design of the recognition system are addressed. Results are given on speaker dependent emotion recognition using the Spanish co...

متن کامل

Speech emotion recognition using hidden Markov models

In emotion classification of speech signals, the popular features employed are statistics of fundamental frequency, energy contour, duration of silence and voice quality. However, the performance of systems employing these features degrades substantially when more than two categories of emotion are to be classified. In this paper, a text independent method of emotion classification of speech is...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: APSIPA transactions on signal and information processing

سال: 2021

ISSN: ['2048-7703']

DOI: https://doi.org/10.1017/atsip.2021.7